ADABOOST ENSEMBLE ALGORITHMS FOR BREAST CANCER CLASSIFICATION

نویسندگان

Morufat Gbolagade Department of Physical Sciences, Computer Science Programme, Al-Hikmah University, P.M.B 1601, Adewole Housing Estate, Ilorin, Kwara State, Nigeria

Moshood Hambali Computer Science Department, Federal University Wukari, P.M.B 1020, Katsina-Ala Road, Wukari, Taraba State, Nigeria

Tinuke Oladele Department of Computer Science, University of Ilorin, P.M.B. 1515, Ilorin-Nigeria

Yakub Saheed Department of Physical Sciences, Computer Science Programme, Al-Hikmah University, P.M.B 1601, Adewole Housing Estate, Ilorin, Kwara State, Nigeria

چکیده مقاله:

With an advance in technologies, different tumor features have been collected for Breast Cancer (BC) diagnosis, processing of dealing with large data set suffers some challenges which include high storage capacity and time require for accessing and processing. The objective of this paper is to classify BC based on the extracted tumor features. To extract useful information and diagnose the tumor, an Adaboost ensemble Model is developed. In this research work, both homogeneous and heterogeneous ensemble classifiers (combine two different classifiers together) were implemented, and Synthetic Minority Over-Sampling Technique (SMOTE) data mining pre-processing is used to deal with the class imbalance problem and noise in the dataset. In this paper, the proposed method is of two steps. The first step employs SMOTE to reduce the effect of data imbalance in the dataset. The second step involves classifying using decision algorithms (ADTree, CART, REPTree and Random Forest), Naïve Bayes and their Ensembles. The experiment was implemented on WEKA Explore (Weka 3.6). Experimental results shows that Adaboost-Random Forest classify better than other classification algorithms with 82.52% accuracy, follow by Adaboost-REPTree and Adaboost-CART with 77.62% accuracy while Adaboost-Naïve Bayes classifications is the lowest with 35.66% accuracy.

Download for Free

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Breast Cancer Survivability via AdaBoost Algorithms

The use of data mining approaches in medical domains is increasing rapidly. This is mainly because the effectiveness of these approaches to classification and prediction systems has improved, particularly in relation to helping medical practitioners in their decision making. This type of research has become important for finding ways to improve patient outcomes, reduce the cost of medicine, and...

متن کامل

An Ensemble Classification Model for the Diagnosis of Breast Cancer Using Stacked Generalization

Introduction: Breast cancer is one of the most common types of cancer whose incidence has increased dramatically in recent years. In order to diagnose this disease, many parameters must be taken into consideration and mistakes are possible due to human errors or environmental factors. For this reason, in recent decades, Artificial Intelligence has been used by medical practitioners to diagnose ...

متن کامل

An Ensemble Classification Model for the Diagnosis of Breast Cancer Using Stacked Generalization

متن کامل

Ensemble classification in steganalysis – Cross-validation and AdaBoost

Two alternative designs to the ensemble classifier proposed in [13] are studied in this report. First, the out-of-bag error estimation is replaced with crossvalidation. Second, we incorporate AdaBoost and modify the weights of the individual training samples as the training progresses. The final decision is formed as a weighted combination of individual predictions rather than through majority ...

متن کامل

Classification Ensemble by Genetic Algorithms

Different classifiers with different characteristics and methodologies can complement each other and cover their internal weaknesses; Thus Classifier ensemble is an important approach to handle the drawback. If an automatic and fast method is obtained to approximate the accuracies of different classifiers on a typical dataset, the learning can be converted to an optimization problem and genetic...

متن کامل

Adaboost Ensemble Classifiers for Corporate Default Prediction

This study aims to show a substitute technique to corporate default prediction. Data mining techniques have been extensively applied for this task, due to its ability to notice non-linear relationships and show a good performance in presence of noisy information, as it usually happens in corporate default prediction problems. In spite of several progressive methods that have widely been propose...

متن کامل

منابع من

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

عنوان ژورنال

Journal of Advances in Computer Research

دوره 10 شماره 2

صفحات 1- 10

تاریخ انتشار 2019-05-01

دنبال کردن

لغو دنبال کردن

{@ msg @}

با دنبال کردن یک ژورنال هنگامی که شماره جدید این ژورنال منتشر می شود به شما از طریق ایمیل اطلاع داده می شود.

کلمات کلیدی

breast cancer Adaboost Synthetic minority over sampling technique Random forest Ensemble

میزبانی شده توسط پلتفرم ابری doprax.com